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Abstract 

We discuss the origin of multiscaling in financial time-series and investigate how 
to best quantify it. Our methodology consists in separating the different sources 
of measured multifractality by analysing the multi/uni-scaling behaviour of syn¬ 
thetic time-series with known properties. We use the results from the synthetic 
time-series to interpret the measure of multifractality of real log-returns time- 
series. The main finding is that the aggregation horizon of the returns can 
introduce a strong bias effect on the measure of multifractality. This effect can 
become especially important when returns distributions have power law tails 
with exponents in the range [2,5]. We discuss the right aggregation horizon to 
mitigate this bias. 

Keywords: multiscaling, multifractality, Central Limit Theorem, power law 
tails, autocorrelation. 


1. Introduction 


The multifractal behaviour of the financial time-series has become one the 



Many works 
nil, re porting 


acknowledged stylized facts in the literature (see: [li 
have been dedicated to its empirical characterization [6 
strong evidence of its presence in financial markets, and models [12l . |1 

EEliilii. 

Understanding which is the origin of the measured multifractality in finan¬ 
cial markets is still an open research challenge. This question has been raised 
first in where the authors pointed out that the power law tails and the auto¬ 
correlation of the analysed time-series must be the two sources of the measured 
multifractality. In the first case, the multifractal behaviour is a consequence 
of the broadness of the unconditional distribution of the returns; while in the 
second case, the multifractal behaviour is associated with the causal structure 
of the time-series. After [^ . many papers have investigated the relative contri¬ 
bution of these two sources to the measured multifractality, however no agree¬ 
ment exists. For example in 2l| the author points out that the autocorrelation 
structure has a minor impact on the measured multifractality while the power 
law tails are the major source of it. In [2^ they also report that the power 
law tails give the major contribution, but they also point out that the pres¬ 
ence of unknown autocorrelations might introduce a negative bias effect in the 
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quantification of multifractality. Conversely, in [ 2 ^ the authors find that the 
autocorrelation gives the major contribution while for a specific time-series the 
“extreme events are actually inimical to the multifractal scaling”. This lack of 
agreement motivated our work, leading us to investigate what the source of the 
measured multifractality is and how it can be detected. 

In this paper we quantify the two contributions by using synthetic times 
series where the two contributions can be separated. Specifically we analyze 
Brownian Motion with innovations drawn from a t-Student distribution. Multi¬ 
fractal Random Walk and normalized version of the Multifractal Random Walk. 
The measured multifractality on these synthetic series are compared with mea¬ 
sures on both real financial log-returns and on a normalized version of the real 
log-returns where the heavy tails are removed. The results show the aggregation 
horizon has a strong effect on the quantification of multifractality. We verify 
however that there are regions of the aggregation horizon that can be used in 
practice to extract reliable multifractality estimators. 

The rest of the paper is organized as follows: in Sec. [2] we perform a brief 
literature review introducing the tools we used for our analysis and discussing 
the results from previous works. In Sec. [3] we review the theoretical models we 
used and we define the multifractality estimators that shall be used throughout 
the paper. Secs. 0] and [S] are dedicated respectively to the analysis of artificial 
and real data. In Sec. [6] we discuss the results while in Sec. [7] we summarize 
the results and conclude. 


2. Background 


2.1. Multifractality 

Among the methods which are used for the empirical measurement of the 
scaling exponents, in this work we will use only the Generalized Hurst Exponent 
method (GHE), see [mo, 00 which relies on the measurement of the direct 
scaling of the gth-order moments of the distribution of the increments and it 
has been shown to be one of the most reliable estimators [ 0 ] . Let us call X {t) 
a process with stationary increments. The GHE method considers the following 
function of the increments 


E[\X{t + r)-Xim=K{q)T'^^^'^\ ( 1 ) 

where r is the time horizon over which the increments are computed and H (q) 
is the Generalized Hurst Exponent. The function f{q) = qH{q) is concave and 
K{q) depends also on q. In particular, GHE considers the logarithm of Eq. (0]) 

In {E\\X{t + r) - X{t)n) = C(g) ln(r) + In {K{q)) , (2) 

and, if linearity with respect to ln(r) holds, it computes the slopes of the straight 
lines at different q. The slopes are computed in the following way: for every q, 
several linear fits are computed taking r G [TminMmax]^ with usually r^m = 1 
and several values of r^ax typically between [5,19]; the output estimator for 
C,{q) is the average of these values for a given q. This method gives also the 


^The code can be found at http://www.mathworks.com/matlabcentral/fileexchange/30076-generalized-hurst-exponent 
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errors which are the standard deviations of these values. However, in this paper 
we do not perform any average over different values of Tmin, Tmax and we instead 
consider just one linear fit for a given range r € [TmimTmax]- In particular we 
focus our attention on two ranges, namely r G [1,19], following the prescription 
of other works ( ^ 2^ 22|), and r G [30, 250]. The reason for this simplification 
is that, given a range of r, we did not want to weight more the small values 
with respect to the big values. This point will be further stressed later in the 
paper. 


2.2. Source of multiscaling in financial data: state of the art 

As already mentioned in the Introduction, there is a debate in literature 
concerning what property of the financial time-series contributes mostly to their 
observed multiscaling behaviour. Let us here discuss some findings present in 


the literature. In [2]| the author studied the Dow Jones Industrial Average 


taken on a daily basis and processed the data in four different ways in order 
to uncover the source of the multiscaling behaviour. The methods used were 

([HI): 

1. shuffling the data in order to check the impact of the shape of the uncon¬ 
ditional distribution; 

2. building up surrogate data with the same unconditional distribution and 
linear correlation of the empirical one but with any non linear correlation 
removed; 

3. cutting the tails by substituting the more extreme events with resampled 
ones from the core of the distribution; 

4. generating surrogate power law-tailed time-series, namely double Weibull 
and t-Student, preserving the temporal structure of the empirical time- 
series. 


The author found that, on one hand the temporal structure, both linear and 
non linear, has a minor impact. On the other hand, the fatter the tails are, the 
stronger the multiscaling. And this result was confirmed both by cutting the 
extreme events and changing the unconditional distribution. 

In [ 2 ^ the authors studied again the Dow Jones Industrial Average taken 
on a daily basis plus the Dow Jones Euro Stoxx 50 sampled at one minute. In 
this case three analysis were performed: 

1. shuffling the whole dataset; 

2. dividing the dataset into intervals and shuffling them in order to keep short 
memory contributions then repeating the analysis changing the length of 
the intervals; 

3. cutting the extreme events. 

The authors found that when shuffled, the dataset loses its multiscaling be¬ 
haviour (Hi). The shuffling of the intervals showed that the linearity of the 
scaling of the fluctuation functions worsen when the length of the interval is 
small and improves increasing it, thus according to the authors this should be 
regarded as a sign that the temporal correlations are the source of multiscaling. 
For what concerns the cut of the most extreme events they found that for the 
Dow Jones Industrial Average extreme events have no particular impact, while 
for the Dow Jones Euro Stoxx 50 they cause a distortion in the Singularity 
Spectrum. 
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Finally in an extensive analysis was conducted on several empirical time- 
series including stock market indexes, exchange rates and interest rates. In 
order to unveil the source of the empirical multiscaling, the shuffling method 
was used plus a comparison with synthetic data. The authors also found an 
increase of the measured multiscaling of the shuffled time-series which then led 
them to draw two conclusions: first that the major source of the multifractality 
comes from the power law tails of the distribution; second that the presence of 
time correlations decreases the multifractality. These conclusions are consistent 
with the analysis of the Markov Switching Multifractal Model ([l3])- Further 
analyses have been conducted by means of fractional Brownian motions, random 
walks with steps drawn from a Levy distribution and ARFIMA processes, all 
confirming the results found on the empirical datasets ('( 2 ^'). 


3. Models and methods 

In this section we describe the analytical properties of the models we used 
for our analysis and the variable we chose to detect the multifractality. 


3.1. Brownian motion with t-Student innovations (tBM) 

We considered a uniscaling process with independent increments drawn from 
a t-Student distribution. Introducing the durnmy variable t, the probability 
density of a t-Student distribution is given by l[28l|l 


P{t) 




n 




(3) 


where n is the number of degrees freedom which can be non-integer. According 
to Eq. m the variable t has mean zero if n > 1 and infinite otherwise. The 

Tl 

variance is instead equal to - if n > 2, infinite if 1 < n < 2 and undefined 

n — 2 

otherwise. The spectrum of a tBM can be computed analytically in both cases, 
either if n is bigger or smaller then two. For n < 2 the t-Student distribution of 
Eq. ([3]) behaves as a stable distribution with skewness parameter equal to zero 
and stability parameter equal to n, so the scaling exponents are (see |2d . l29ll30l | rl 


({q)=qH{q) = ^ ifg<n. (4) 

Eor n > 2 and finite aggregation horizon r it can be shown that 

E[\X{t + r)-Xm=f{q)ri. (5) 


Thus 

C{q)=qH{q) = ^ ifg<n. (6) 

It is expected then that for n > 2 the scaling exponents are identical to the one 
of a BM up to g = n. Eor n = 2, it can be proved rigorously that the scaling 
exponents behave like Eq. (ini) (cfr. [Slj). 

According to these analytical observations a tBM is a unifractal process both 
for n < 2 and n > 2 and ({q) behaves as a straight line. 


^In is reported the shape of the scaling exponent for g > n to be equal to one. However, 
as underlined in [29| and fSQj . this so called bifractal behaviour is a pure finite size sample 
effect. 
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3.2. Multifractal Random Walk (MRW) 


Among the models proposed in the literature, in the present paper we chose 
as a benchmark multifractal model the so-called Multifractal Random Walk in¬ 
troduced in [17| . Its main appealing property is that it has exactly computable 
scaling exponents. We report that this model has been further developed and 
alternative multifractal random walks models with different scaling exponents 
has been proposed (see 18, I^), however for our purposes the statistical prop¬ 
erties of this original model are sufficient. In the discrete version, the process 
X{t) described by the model is defined as ( 0 ) 


t 

in 




fe=i 


so the increments can be written as 


Tt {t) = X{t + T) - X {t) 




fe —+1 


(7) 


( 8 ) 


with EAt ~ cr^At), WAt ~ N{—X'^ ln(L/At), ln(A/At)), where A is called 
intermittency parameter, L is the autocorrelation len gth , a is the variance of 
the overall process and At is the discretization step ([17|). The peculiarity of 
this model is that, while the e^tik) are independent, the uJAtik) are not, having 
autocovariance ([IZj): 

Cov{u;At{ki),uJAtik2)) = A^ lnpAt(fci - ^ 2 ), (9) 

with 

PAi(fci-fc2) = | (|A:i-fc2| + l)At \f^i-k2\<L/At, 

I 1 otherwise. 

The scaling exponents of this model in the continuous limit are (0): 

C(g) = giJ(g) = -yg^ + (A^ + i)g. (11) 

The importance of this model relies in the fact that, by means of just three 
parameters (A, L, cr), it exhibits both power law tails and volatility clustering, 
keeping its plain innovations uncorrelated. In particular the intermittency pa¬ 
rameter A determines both the power law tails, which decay with an exponent 
proportional to A^ (HI), and the decay of the autocorrelation functions of the 
powers of the absolute returns, whose decaying exponents are again proportional 
to A2 ([13). 


3.3. Multifractality estimator 

In order to understand the behaviour of the scaling exponents C{q), we used 
the Generalized Hurst Exponent, H{q) (see Eqs. (HI and dH). Let us note that 
due to the presence of the power law tails in the empirical datasets ([H, [s^l), the 
value of q should be less then the tail exponent of the analysed time-series, since 
the moments are not finite for large q. Moreover, the existence of a moment 


5 






does not guarantee its measurement on finite samples to be reliable when its 
variance is not finite. Following these observations, along with the fact that the 
decay exponents of the empirical power law tails typically range between two 
and five (@), in our analyses we limited ourselves to g < 1. In particular we 
took a range of q between 0.1 and 1 every 0.1 units, having 10 points in totaj^. 
To assess the presence of a statistically meaningful curvature in the scaling 
exponents, thus multiscaling, we performed a parabolic fit over the range g < 1 
and then we took the coefficient of the second degree term as a multiscaling 
estimator, i.e. 

C{q) = qH{q) Bq"^ + Aq + const, (12) 

where then B is the multifractality estimatoi0 we adopted in this paper. It 
must have negative (multiscaling behaviour) or zero (uniscaling behaviour) ex¬ 
pectation value (due to concavity). The expected value of the parameter const 
is zero and in our measurements of C(g) we always checked this condition for 
consistency. Note that in the authors fit the Singularity Spectrum, with 
a fourth degree polynomial which implies necessarily a fourth degree polyno¬ 
mial functional form for C('z)- However, for our purposes, a second degree fit is 
enough and we verified that the inclusion of the terms up to the fourth degree 
does not modify our results. 

4. Analysis of artificial data 

We started our analysis simulating 10^ MRW processes, specified in Subsec. 
13.21 made of 10® steps 5t with parameters = 0.03, L = 1000, <7 = 1 and 
computing the mean and the standard deviation of B, H{0.5) and H{1) over 
the realizations. We then repeated the measure over the shuffled version of the 
time-series. The convergence of the estimators has been always checked. The 
values of and L have been chosen according to empirical analyses conducted in 
other works (see for example [H), while the length has been chosen to reduce as 
much as possible the finite size sample errors keeping reasonable computational 
times. The results are reported in Tabs. [T]and[2]with respectively r € [1,19] and 
T G [30,250]. The theoretical values are reported in boldface within brackets 
under the measured values. 


MRW 

plain 

shuffled 

B 

-0.0090 ±0.0006 

(-0.015) 

-0.0273 ±0.0006 
(0) 

iJ(0.5) 

0.514 ±0.001 

(0.5225) 

0.541 ±0.001 

(0.5) 

ff(l) 

0.509 ±0.001 

(0.515) 

0.527 ±0.001 

(0.5) 


Table 1: Comparison between B, and H{1) for a plain and a shuffled 

MRW with T G [1,19]. 


®We checked that increasing the number of points over the interval does not change the 
results. 

“^The notation of the hat means the estimator of the quantity under it. 


6 









MRW 

plain 

shuffled 

B 

-0.014 ±0.002 

-0.002 ±0.002 

(-0.015) 

(0) 

H{0.5) 

0.521 ±0.005 
(0.5225) 

0.503 ±0.005 

(0.5) 

H{1) 

0.514 ±0.005 

0.502 ±0.005 

(0.515) 

(0.5) 


Table 2: Comparison between B, H{0.5) and H{1) for a plain and a shuffled 
MRW with T G [30,250]. 


It is evident from the Tables that in the region r G [1,19] also for MRW 
the non lineariW of the scaling exponents increases after shuffling confirming 


the results of [2^, while in the region r G [30,250] this effect disappears and 


the shuffled process seems statistically undistinguishable from a BM. According 
to its definition (see Sec. [I, a shuffled MRW is an uncorrelated, symmetric 
time-series with power law tails. In light of this, a model which might give us 
some further indication is a tBM. In the next subsection we focus on this model. 


4-.1. The effect of the power law tails 

Let us here report the estimators B, H{0.5) and H{1) in the presence of 
power law tails. In Fig. [T] we report the results of the computation of the 
scaling exponents (^{q) for r G [1,19] of single realizations of processes with t- 
Student innovations made of 10® steps, for various values of n: n G [1, 5] every 
0.5 units (cfr. Eq. [S]). In blue solid line the measured scaling exponents of the 
synthetic time-series are reported, whereas in dashed red line the theoretical 
expectation (see Eqs. and (tni). It is evident that as soon as n moves away 
from 1, a curvature of f{q) arises. But it is also evident that, as the tail index 
increases above n = 2 the graphs become more linear with apparent linearity 
almost recovered above n = 5. It is worth noting that the empirically measured 
tail indexes fall exactly in the range [2, 5], which is the most numerically biased. 
In order make a quantitative assessment, for each value of n = 3,4,5, which 
roughly covers the range of empirically observed tails, we simulated 10^ tBM 
made of 10® steps and we computed the mean and the standard deviation of B, 
H{0.5) and H{1) for all values of n. Tabs. [3] and 0] report the numerical results 
for respectively r G [1,19] and r G [30, 250] (theoretical values in boldface within 
brackets under measured values). 


tBM 

n = 3 

n = 4 

n = 5 

B 

-0.0364 ±0.0007 

-0.0251 ±0.0005 

-0.0186 ±0.0005 

(0) 

(0) 

(0) 

77(0.5) 

0.570 ±0.001 

(0.5) 

0.544 ±0.001 

(0.5) 

0.531 ±0.001 

(0.5) 

77(1) 

0.552 ±0.001 

0.531 ±0.001 

0.522 ±0.001 

(0.5) 

(0.5) 

(0.5) 


Table 3: Mean and standard deviation of B, H{0.5) and H{1) computed on 
t-Students time-series with n = 3,4, 5 and r G [1,19]. 
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tBM 

n = 3 

n = 4 

n = 5 

B 

(-9± 2) • 10-^ 
(0) 

(-4 ±2)- 10"^ 

(0) 

(-2± 2) • 10“^ 

(0) 

iJ(0.5) 

0.517 ±0.005 

(0.5) 

0.506 ±0.005 

(0.5) 

0.503 ±0.005 

(0.5) 

^(1) 

0.513 ±0.005 
(0.5) 

0.504 ±0.005 
(0.5) 

0.502 ±0.005 

(0.5) 


Table 4: Mean and standard deviation of B, H{0.5) and H{1) computed on 
t-Students time-series with n = 3,4, 5 and r € [30, 250]. 


Let us note that in the range r G [1,19] with a significance level of 1%, a 
multiscaling behaviour is found due to the presence of power law tails in all cases, 
while in the range t G [30, 250] only the case n = 3, keeps its concavity at 1% 
significance level, but still very lowered with respect to the other region. Thus 
the measurements in the latter region seem to agree better with the theoretical 
uniscaling behaviour. 
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Figure 1: Numerical values of ({q) (blue line) against its theoretical values (red 
dashed line) for a tBM with n = [1,1.5, 2, 2.5, 3,3.5,4,4.5, 5] taken every 0.5 
units, in increasing order from left to right and top to bottom. 


4--2. Ejfect of autocorrelations 

In order to isolate the contribution of the autocorrelation and eliminate 
the effect of the tails, we applied a normalization procedure to the MRW. The 
method consists in changing the unconditional distribution of a time-series into a 
desired one preserving its causal structure as proposed in 21| . Before we proceed 


























































we need to stress a detail. If the empirical time series has power law tails while 
the surrogate is normally distributed, the autocovariance of the second one has 
the same functional form of the first one, but its strength is lowered. This can 
be simply ascribed to the fact that the extreme events give a big contribution in 
the computations of the averages, thus normalizing them reduces the strength of 
the correlations at each lag. This effect can be easily seen by plotting in semilog 
scale on the same figure the function proposed in 32| for the estimation of the 
parameters of the MRW for a MRW and its normalized version (nMRW). This 
is shown in Fig. [5] in semilog scale where we observe that the autocovariance of 
the original time-series follows well the theoretical behaviour ([s^) 


C{T) = Cov [ln|r^(t-bT)|,ln|r^(<)|] = In 


L 


T+1 


(13) 


whereas the normalized one has a smaller effective value of A. It is evident 
that the slope of the line relative to the normalized process is smaller than the 
slope of the line relative to the plain one (in absolute value). The behaviours of 



In(lags) 


Figure 2: Autocovariance function of the log absolute returns for a plain (blue) 
and normalized (red) path drawn from a MRW made of 10® steps with A = 0.3, 
L = 1000, cr = 1. 


the scaling exponents C(<z) for r € [1,19] of single realizations of nMRWs made 
of 10® steps for different degree of autocorrelation A, specified in the captions, 
L = 1000 and cr = 1 are reported in Fig. [31 As noted before the effective value 
of A after the normalization is a bit lower then the one reported in the captions, 
so the theoretical line is plotted recomputing the value of A over the normalized 
processes. 
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Figure 3: Numerical values of C(g) (blue line) against its theoretical values (red 
dashed line) for a nMRW with A = [0.1,0.2,0.3,0.4], in increasing order from 
left to right and top to bottom. 


We observe that, in all cases, the function (^(g) changes its concavity. In order 
to make a quantitative assessment, for each value of = 0.03, 0.04, 0.05, we 
simulated 10^ MRWs made of 10® steps, we normalized them and we computed 
the mean and the standard deviation of B along with iF(0.5) and H{1). Tabs. 
El and m report the numerical results for r S [1,19] and r S [30,250] together 
with the theoretical expected values in boldface under the measured one. The 
effective value of A, called Ae// in the table, which affects B, H{Q.5) and H{1), 
was obtained from Eq. m by fitting the autocovariance of each normalized 
time-series taking then the mean and the standard deviation. 



A^ = 0.03 

A^ = 0.04 

A^ = 0.05 

^iff 

0.0223 ±0.0005 

0.0279 ±0.0007 

0.0330 ±0.0008 

B 

0.0075 ±0.0005 

(-0.0111 ±0.0003) 

0.0085 ±0.0005 

(-0.0139 ± 0.0003) 

0.0093 ±0.0006 

(-0.0165 ±0.0004) 

ff(0.5) 

0.489 ±0.001 

(0.5167 ± 0.0004) 

0.487 ±0.001 

(0.5209 ±0.0005) 

0.486 ±0.001 

(0.5247 ±0.0006) 

H{1) 

0.492 ±0.001 

(0.5111 ± 0.0003) 

0.491 ±0.001 

(0.5139 ±0.0003) 

0.490 ±0.001 

(0.5165 ±0.0004) 


Table 5: Mean and standard deviation of B, H{0.5) and H{1) computed on 
nMRWs with L = 1000, (7 = 1 and r € [1,19]. 
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= 0.03 

A^ = 0.04 

A"" = 0.05 

Kf.f 

0.0223 ±0.0005 

0.0279 ±0.0006 

0.0330 ±0.0008 

B 

-0.007 ±0.002 

(-0.0111 ±0.0003) 

-0.009 ±0.002 

(-0.0139 ± 0.0003) 

-0.010 ±0.002 

(-0.0165 ±0.0004) 

ff(0.5) 

0.511 ±0.005 

(0.5167 ± 0.0004) 

0.513 ±0.005 

(0.5209 ±0.0005) 

0.515 ±0.005 

(0.5247 ±0.0006) 

H{1) 

0.507 ±0.005 

(0.5111 ± 0.0003) 

0.508 ±0.005 

(0.5139 ±0.0003) 

0.509 ±0.005 

(0.5165 ±0.0004) 


Table 6: Mean and standard deviation of B, H{0.5) and H{1) computed on 
nMRWs with L = 1000, a = 1 and r € [30, 250]. 


These results confirm the change of the concavity of the scaling exponents 
in the region r G [1,19]. Indeed, we observe in Tab. |6]that, within the 1% 
significance level, all B stay positive. Positive values of B imply the convexity 
of the function ((q), which, in the multifractal picture, is supposed to be concave. 
The region r € [30, 250] is instead much more well-behaved having in all three 
cases concave scaling exponents within the 1% significance level, despite for 
= 0.05 only (the most correlated) the measured B falls slightly outside the 
1% significance level from the expected value. 

5. Analysis of a real dataset 

5.1. Dataset 

The dataset we focused our attention on is the Dow Jones Industrial Average 
(INDU) from 02/01/1900 to 29/12/2000 taken on a daily basis, made of 25, 366 
points. We report in Fig. |T]the scaling of the moments (cfr. Eq. ([T])) respectively 
for T € [1,19] and t £ [30, 250] in blue solid lines along with their linear fit in 
red dashed lines. 




Figure 4: Left panel: scaling of the moments of the INDU time-series with 
r £ [1,19]. Right panel: scaling of the moments of the INDU time-series with 
T £ [30,250]. The values of q are taken in the interval [0.1,1] every 0.1 units, 
increasing from top to bottom in both panels. 
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In Fig. [S]the scaling exponents ({q) are reported again in both regions of r, 
(blue crosses); as it appears evident, the parabolic shape of Eq. [T2](red dashed 
lines) seems to fully capture the empirical behaviour. 




scaling exponents 
-fitted curve 


0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 O.B 0.9 1 
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0.7 


0.6 
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0.2 


0.1 


0 ^ 
0 


+ scaling exponents 
— fitted curve 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

q 


Figure 5: Left panel: scaling exponents of the INDU time-series with r € [1,19]. 
Right panel: scaling exponents of the INDU time-series with t G [30, 250] 


This time-series exhibits power law tails and we computed the decay expo- 

m, 


nents of the tails using the method proposed in 


based on Maximum- 


Likelihood Estimators and the Kolmogorov-Smirnov test. Fig. [6] reports the fit 
of the complementary cumulative distribution of the left and the right tails in 
loglog scale. For the left tail on the x-axis is reported the logarithm of minus 
the negative returns. The estimated values of the tails exponents are 


aieft = 3.20 ± 0.05 a^ght = 3.61 ± 0.06; 


(14) 


they are different within the errors and so the time series exhibits skewness. We 
verified that however skewness has no effects on the measured multifractality. 




Figure 6: Left panel: left tail of the INDU time-series. Right panel: right tail 
of the INDU time-series. 

5.2. Effect of power law tails and autocorrelation in real data 

In order to uncover the source of the multiscaling behaviour of our dataset 
we used two procedures: the shuffling (cfr. [1^), in order to isolate the effects 
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of the power law tails, and the normalization (cfr. [2l|), in order to isolate the 
effects of the autocorrelation. We focused first on the region r € [1,19]. A first 
test we made is a comparison of the scaling exponents of the INDU an a tBM, 
in order to check whether the empirical measured multiscaling behaviour after 
shuffling could be all ascribed to the presence of the power law tails or not. In 
order to do so, we took the INDU time-series and shuffled it 10^ times. On 
every time-series obtained we computed B, H{0.5) and H{1), this allowed us to 
associate a mean and a standard deviation coming from the shuffling procedure. 
We then compared these values with the ones obtained computing B, H{0.5) 
and H{1) on 10^ tBM with the same length of the INDU time-series and tails 
equal to the heavier empirical one, namely aieft- A second test regards checking 
the behaviour of the INDU time-series after normalization in order to test if the 
change of concavity holds for empirical data. We normalized then our time- 
series 10^ times, computing the mean and the standard deviation of B, H{0.5) 
and H{1). The results are reported in Tab. [7]along with the value of B, H{0.5) 
and H{1) computed on the plain time-series. 



INDU 

shuf fled 

normalized 

tBM 

B 

-0.019 

-0.039 ±0.003 

0.0026 ±0.0005 

-0.034 ±0.004 

77(0.5) 

0.552 

0.572 ±0.007 

0.5082 ±0.0008 

0.563 ±0.007 

77(1) 

0.541 

0.551 ±0.006 

0.5092 ±0.0006 

0.546 ±0.007 


Table 7: Plain, shuffled and normalised INDU time-series and a t-Student with 
re [1,19]. 


According to these simulations we confirm previous results that after shuf¬ 
fling the measured multiscaling behaviour of real data increases for r e [1,19] 
(see 22|). Moreover it appears evident that this increased value is statistically 
undistinguishable from the one of the tBM, which is uniscaling. This result 
led us to infer that the multiscaling measured on shuffled empirical time-series 
should be ascribed only to the presence of power law tails. 

The normalised time-series changes its concavity after normalization (stays posi¬ 
tive within the 1% significance level), showing the same issue observed previously 
for the MRW. 

Let us now turn our attention to the region r G [30, 250]; results are reported 
in Tab. [51 



INDU 


normalized 

tBM 

B 

-0.038 

-0.01 ±0.01 

-0.0036 ±0.0007 

-0.014 ±0.007 

77(0.5) 

0.624 

0.53 ±0.03 

0.6244 ±0.0006 

0.52 ±0.02 

77(1) 

0.605 

0.52 ±0.03 

0.6229 ±0.0005 

0.52 ±0.02 


Table 8: Plain, shuffled and normalised INDU time-series with r £ [30, 250]. 


We observe first that the results change considerably. Secondly, within the 
1% significance level the shuffled time-series can be considered uniscaling, as 
it happens for the tBM, so there is not an increase in multifractality. Thirdly 
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the normalized time-series keeps its concavity, thus it is not affected anymore 
by the negative bias mentioned previously. This therefore demonstrates that a 
statistically significant multiscaling behaviour is present in financial time-series. 

6. Discussion 

Our analyses provide clear evidence that the estimation of the scaling ex¬ 
ponents are affected by the aggregation horizon. We chose two regions: 1) 
T G [1,19], which is in line with previous works and 2) r € [30,250]. We 
observed that the analyses on the region r G [1,19] do not reproduce the theo¬ 
retical expectations on time-series exhibiting power law tails or autocorrelation 
structures like the empirical ones. We also found an unexpected concavity of 
the scaling exponents ((q) on tBMs and nMRWs. These results are in line with 
previous observations on real time-series and actually enable us to give them 
an explanation. In particular in the authors argue that the presence of 
autocorrelations in real data can induce a negative bias in the estimation of the 
scaling exponents. According to our interpretation, the change of concavity of 
((g) (reported in Tab. [7|) is exactly the effect of the negative bias. In light of 
this, the increased multiscaling behaviour measured in [22j| after shuffling has 
to be ascribed to the fact that the causal structure of a shuffled time-series is 
destroyed along with the negative bias itself and only the power law tails effect 
is left resulting in an apparent increase of multiscaling. 

For what concerns the region r G [30, 250] we observed that the spurious 
multiscaling found on tBM processes and on the INDU time-series is lower with 
respect to the measurements performed in the r G [1,19] region, being even 
statistically absent for n = 4, 5 and for the INDU as well. Furthermore, the 
convexity of ((q) returns to a concavity, almost removing the negative bias 
effect. We conclude therefore that GHE measurements of multifractality in the 
region r G [30, 250] are reliable and reveal that some degree of multifractality is 
present in real financial log-return time series and it has to be ascribed to the 
effect of the causal structure of the process. 

At this point a question to address is why there is a so big difference in 
the two regions of r. For what concerns the effect of the tails we explain this 
difference via the speed of convergence of the Central Limit Theorem (CLT). In 
particular, for processes exhibiting increments with power law tails, with tails 
index bigger than two, it is well-known that under aggregation they behave, 
in the asymptotic limit, as a BM. The speed of convergence depends on how 
heavy the tails are but if the aggregation is finite, whatever the tails index is, 
there will always be a region in the final part of the tails of the probability 
density which will have a power law behaviour. The effect of increasing the 
aggregation horizon is to push this region further in the tail. This explains why, 
increasing the aggregation horizon, the spurious power law tails concavity tends 
to disappear, reconciling with the theoretical expectations. Counter-intuitively 
processes with increments exhibiting tails with exponents less then two are less 
affected by this problem, since their convergence under aggregation is ruled by 
a generalized Central Limit Theorem and they keep their power law nature 
in the tails of the distribution so the convergence is faster. Concerning the 
autocorrelation negative bias, our interpretation is that it may be caused by 
the fact that the average of a strongly correlated variable does not necessarily 
converge to the expectation value. In this respect the effect might be reduced 
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in the region r G [30, 250] because taking bigger aggregation horizon implies 
averaging over less correlated variables. 

In light of these results we argue that in order to make a reliable measure of 
multifractality, regions of r with a small aggregation horizon should be taken 
with care. Let us however stress that the region r G [30, 250] has not been 
chosen optimizing the performance of the multifractal estimator. However it 
proved to be sufficient to give us valuable insights and improved our estimation 
of the scaling parameters. 

Let us make few other observations concerning the measurements. Since 
the measures, as proposed here (cfr. Subsec. 12.IL depend on two parameters. 
Train and Tmax, we report that in general, Tmi„ rules the precision while Tmax 
the accuracy. So a bigger value of Tmin would reflect in measured values nearer 
to expected ones. On the other hand taking bigger values of Tmax ends up 
in including more oscillating values in the analysis ,thus in a larger standard 
deviation. However for a process like the MRW, attention must be paid, since, if 
Tmin becomes bigger than the autocorrelation length, no multifractal behaviour 
holds anymore, since the increments of the process become independent. So the 
range of r must be taken large enough to reduce as much as possible the power 
law tails effect, but not too much to exceed the time-span where the correlations 
are relevant. Finally, we notice that it appears evident that at small ranges of r 
the power law tails concavity has a bigger impact to the measures with respect 
to the convexity induced by the autocorrelation. 

7. Summary and outlook 

In this paper we studied the multiscaling behaviour of financial time-series 
by studying synthetic and real datasets at different aggregation horizons. We 
started by analysing the MRW, finding that, for small aggregation horizons, 
the multiscaling behaviour after shuffling, appears to increase, in agreement 
with previous works on empirical datasets. However for larger aggregation hori¬ 
zons this effect disappears. Since the shuffling procedure destroys the temporal 
structure of a time-series, but preserves its unconditional distribution, we fo¬ 
cused our attention on the scaling properties of another process, the tBM which 
is a unifractal process. It turned out that for small aggregation horizons the 
presence of power law tails induces a concavity in the scaling exponents, indi¬ 
cating therefore a multiscaling behaviour which is however not predicted by the 
theory. We turned then our attention to the causal structure of a time-series. In 
this case we observed that, at small aggregation horizons, the presence of auto¬ 
correlation introduces a negative bias, i.e. a reduced concavity which ended up 
in a convexity of the scaling exponents, both for synthetic and real time-series. 
These numerical findings explain well the puzzling increase in multifractality 
found in previous works after shuffling: as long as both power law tails and au¬ 
tocorrelation are kept, the spurious multiscaling contribution of the tails is lessen 
by the presence of the autocorrelation, while after shuffling, only the tails effect 
is present. We pointed out that the aggregation of the returns is crucial. Indeed 
for higher aggregation horizons all these issues disappear or at least strongly 
lessen. For what concerns the tails we interpret this effect as a consequence 
of the Central Limit Theorem and its speed of convergence on time-series with 
power law tails but finite variance. In particular the range of tail exponents 
between two and five turned out to affect the most the measurements. This is 
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due to the fact that under aggregation a residual of the power law tail is always 
present in the unconditional distribution and the nearer the exponent is to two, 
the stronger the effect. We finally note that, choosing higher values of aggrega¬ 
tions can reduce this effect but this requires to have longer time-series. We plan 
in the future to study in more detail this issue trying to provide a recipe for the 
best choice of the region of r which is capable to capture the multifractality of 
the empirical time-series. 
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